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-continued 
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CTG 
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CTG 


GCC 
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GCC 


ACC 


CTC 


TCG 


CAO 


ACC 


ATA 


GTG 


CCC 


ATC 


TGC 


CTC 


CCG 


OAC 


AGC 


GGC 


CIT 


GCA 


GAG 


COC 


GAG 


CTC 


AAT 


CAG 


GCC 


GGC 


CAG 


GAG 


ACC 


CTC 


GTG 


ACG 


GGC 


TOO 


GGC 


TAC 


CAC 


AGC 


AGC 


COA 


GAG 


AAG 


GAG 


GCC 


AAG 


AGA 


AAC 


CGC 


ACC 


TTC 


GTC 


CTC 


AAC 


TTC 


ATC 


AAG 


ATT 
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GTG 


GTC 


CCG 


CAC 


AAT 
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GOG 
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GAC 
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GCC 


CCC 


CAG 


AAG 


AGC 


TOO 


OCA 


ccr 


TAO-y 























wherein 
A is deoxyadenyl, . 
G is deoxygoanyU 



R is 5'-GCC CAC CAG GTG CTG CGG ATC 

CGC AAA CGT-3', 
¥^ must necessarily be 



5'-ATG TGG CAO CTC ACA AGC CTC CTG CTG TTC GTG 
GCC ACC TGG GOA ATT TCC GGC ACA CCA OCT CCT 
CTT GAC TCA OTO TTC TCC AGC AGC GAG CGT^3'; 



C is deoxycytidyl, 
T is tbymidyl, 

R is 5'-GCC CAC CAG GTG CTG CGG ATC 
CGC, AAA CGTO' or y-CAC CAG GTG CTG 



and that when 
R is 5'-CAC CAG GTG CTG CGG ATC CGC 

AAA CGT.3', 
Ri must necessarily be 



y-ATO TOG CAG CTC ACA AGC CTC CTG CTG TTC GTG 
GCC ACC TOO OGA ATT TCC GCC ACA CCA OCT CCT 
CTT OAC. TCA GTG TTC TCC AGC AGC GAG COT GCO-3'. 



CGG ATC CGC AAA CGT-3' The compounds of the present invention encode 

R^ is human protein C, and the heretofore unknown amino 
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ACA 
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CTC 


CTG 


CTG 


TTC 
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ATT 
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OCT 


CCT 
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TCA 


GTG 


TTC 


TCC 


AGC 


AGC 


GAG 


COT— 3' 


or5'.ATO TOO 


CAG 


CTC 


ACA 


AGC 


CTC 


CTO 


CTO 


TTC 


OTO 


GCC ACC 


TOO 


OCA 


ATT 


TCC 


GGC 


ACA 


CCA 


OCT 


CCT 


CTT OAC 


TCA 


OTO 


TTC 


TCC 


AGC 


AGC 


GAG 


COT OCC-3' 



M is 0 or 1, and acid sequence of nascent human protein C when M and 

N is 0 or 1, N are 1. The amino acid sequence, numbered to facili- 

provided that when M is 0, N must necessarily also be 0; tate further discussion, of nascent human protein C is: 

and that when 



5 10 19 

H2N-MBT TRP OLN LEU THR SER LEU LEU PHE VAL ALA THR TRP GLY ILE 



20 Z5 30 

SER GLY THR PRO ALA PRO LEU ASP SER VAL PHE SER SER SER OLU ARG 

35 40 I 45 

ALA HIS GLN VAL LEU ARO ILE ARG LYS ARG ALA ASN SER PHE LEU GLU 

50 33 -i-V eo 

GLU LEU ARO HIS SER SER LEU OLU ARO OLU CYS ILE OLU OLU ILE CYS 

65 70 73 80 

ASP PHE GLU GLU ALA LYS GLU ILE PHE OLN ASN VAL ASP ASP THR LEU 

85 90 . 95 

ALA PHE TRP SER LYS HIS VAL ASP GLY ASP GLN CYS LEU VAL LEU PRO 

100 105 ' no 

LEU GLU HIS PRO CTS ALA SER LEU CTS CYS GLY HIS GLY THR CYS ILE 



4,775,624 



-contmued 

113 120 123 

ASP GLY ILE GLY SER PHE SER CYS ASP CYS ARG SER GLY TRP GLU OLY 

130 133 140 

ARG PHE CYS GLN ARO OLU VAL SER PHE LEU ASN CYS SER LEU ASP ASN 

145 150 155 160 

GLY GLY CYS THR HIS TYR CYS LEU OLU OLU VAL GLY TRP ARG ARG CYS 

165 185 190 

SER CYS ALA PRO GLY TYR LYS LEU GLY ASP ASP LEU LEU OLN CYS HIS 

180 183 190 

PRO ALA VAL LYS PHE FRO CYS OLY ARO PRO TRP LYS ARO MET GLU LYS 

199 200 203 

LYS ARO SER HIS LEU LYS ARO ASP THR OLU ASP OLN OLU ASP OLN VAL 

210 215 220 

ASP PRO ARO LEU ILE ASP GLY LYS MET THR ARO ARO OLY ASP SER PRO 

225 230 235 240 

TRP OLN VAL VAL LEU LEU ASP SER ^LYS LYS LYS^ LEU ALA CYS GLY ALA 

243 250 255 

VAL LEU ILE HIS FRO SER TRP VAL LEU THR ALA ALA HIS CYS MET ASP 

260 263 ' 270 

OLU SER LYS LYS LEU LEU VAL SRO LEU OLY OLU TYR ASP LEU ^ ARO ARG^ 

275 280 285 

TRP GLU LYS TRP GLU LEU ASP LEU ASP ILE LYS GLU VAL PHE VAL HIS 

290 295 300 

PRO ASN TYR SER LYS SER THR THR ASP ASN ASP ILE ALA LEU LEU HIS 

303 310 313 320 

LEU ALA GLN PRO ALA THR LEU SER OLN THR ILE VAL PRO ILE CYS LEU 

323 330 333 

PRO ASP SER GLY LEU ALA OLU ARO GLU LEU ASN OLN ALA OLY GLN OLU 

340 345 350 

THR LEU VAL THR OLY TRP GLY TYR HIS SER SER ARG GLU LYS GLU ALA 

353 360 365 

LYS ARO ASN ARO THR PHE VAL LEU ASN PHE ILE LYS ILE PRO VAL VAL 

370 375 380 

PRO HIS ASN OLU CYS SER OLU VAL MET SER ASN MET VAL SER OLU ASN 

383 390 393 400 

MET LEU CYS ALA OLY ILE LEU OLY ASP ARO OLN ASP ALA CYS OLU OLY 

405 410 415 

ASP SER GLY GLY PRO MET VAL ALA SER PHE HIS GLY THR TRP PHE LEU 

420 425 430 

VAL OLY LEU VAL SER TRP GLY GLU GLY CYS OLY LEU LEU HIS ASN TYR 

435 440 443 

OLY VAL TYR THR LYS VAL SER ARO TYR LEU ASF TRP ILE HIS OLY HIS 

450 433 460 

ILE ARO ASP LYS OLU ALA PRO OLN LYS SER TRP ALA PRO-COOH 



wherein 

H2N- is the amino-tenninusy 

— COOH is the cartx)xy-teniuini8» 

ALA is Alaninep 

ARO is Arginine, 

ASN is Asparagine, 

ASP is Aspartic acid, 

CYS is Cysteine, 

GLN is G]utamine» 

GLU is Glutamic Acid, 

GLY is Glycine. 

HIS is Histidine, 

ILE is Isoleucine, 



LEU is Leucine, 

LYS is Lysine, 

MET is Methionine, 

PHE is Phenylalanme, 
60 PRO is Proline, 

SER is Serine, 

THR is Threonine, 

TRP is Tryptophan, 

TYR is Tyrosine, and . 
65 VAL is Valine. 

The DNA compounds of the present invention are 

derived from cDNA clones prepared from human liver 

mRNA that encodes human protein C activity. In con- 



